Improvement of the jpHMM approach to recombination detection in viral genomes and its application to HIV and HBV

نویسنده

  • Anne-Kathrin Schultz
چکیده

Accurate virus genotyping and the detection of recombinant strains are of crucial importance for understanding viral evolution as well as the design of potential vaccines and treatment strategies. A very accurate tool for detecting recombinations in genomic HIV-1 sequences is jpHMM (jumping profile Hidden Markov Model). For a given sequence, it predicts recombination breakpoints and assigns a parental subtype to each segment in between two breakpoints. In this thesis, modifications and extensions of jpHMM are carried out to improve the reliability of the recombination prediction, to reduce the runtime of the program and to allow the analysis of recombinations in circular genomes. As incorrect subtype assignments or recombination predictions may lead to wrong conclusions in epidemiological or vaccine research, it is important to assess the reliability of the predicted recombination in a particular sequence. For this reason, the output of jpHMM is extended to include a tagging of regions where the model is uncertain about the predicted subtype and an interval estimate for each predicted breakpoint. It is shown that this extension strongly improves the reliability of the recombination prediction. To allow an efficient application of jpHMM to large data sets or species with a large number of subtypes, the complex architecture of the model is substantially modified. Evaluation on HIV-1 as well as hepatitis B virus (HBV) data shows that these modifications lead to a considerable reduction of the runtime of the program. Furthermore, an extension of jpHMM to detect recombinations in viruses with circular genomes such as HBV is introduced. Recombination analysis in circular genomes is usually done on artificially linearized sequences using linear models. Since these models are normally unable to model dependencies between nucleotides at the 5’ and 3’ end of a sequence, this can result in inaccurate predictions of breakpoints and thus in incorrect classifications of circular genomes. In contrast, the circular jpHMM takes into account the circularity of the genome. Its accuracy is evaluated on a large set of recombinant HBV sequences. Additionally, about 3000 full-length HBV sequences are studied to detect socalled circulating recombinant forms (CRF). For this, certain criteria for classifying recombinant HBV sequences are proposed. Based on these criteria, 17 CRFs can be identified.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

jpHMM: recombination analysis in viruses with circular genomes such as the hepatitis B virus

jpHMM is a very accurate and widely used tool for recombination detection in genomic sequences of HIV-1. Here, we present an extension of jpHMM to analyze recombinations in viruses with circular genomes such as the hepatitis B virus (HBV). Sequence analysis of circular genomes is usually performed on linearized sequences using linear models. Since linear models are unable to model dependencies ...

متن کامل

Point Prevalence of Hepatitis B Virus Infection among Adolescents in Visakhapatnam, India

Introduction: Occurrence of detectable amounts of viral antigen or viral particles in the blood of infected patients with Hepatitis B virus (HBV) is a significant characteristic of HBV infection. Detection of HBV antigen or its DNA among individuals of a community is a crucial factor to know the burden of HBV infection. Hepatitis B surface antigen (HBsAg) is a suitable marker of HBV infection b...

متن کامل

jpHMM: improving the reliability of recombination prediction in HIV-1

Previously, we developed jumping profile hidden Markov model (jpHMM), a new method to detect recombinations in HIV-1 genomes. The jpHMM predicts recombination breakpoints in a query sequence and assigns to each position of the sequence one of the major HIV-1 subtypes. Since incorrect subtype assignment or recombination prediction may lead to wrong conclusions in epidemiological or vaccine resea...

متن کامل

Association of Tomato Leaf Curl New Delhi Virus, Betasatellite, and Alphasatellite with Mosaic Disease of Spine Gourd (Momordica dioica Roxb. Willd) in India

Background: Spine gourd (Momordica dioica Roxb. Willd) is one of the important cucurbitaceous crops grown across the world for vegetable and medicinal purposes. Diseases caused by the DNA viruses are becoming the limiting factors for the production of spine gourd reducing its potential yield. For the commercial cultivation of the spine gourd, propagation material used by most o...

متن کامل

A Rare HBV Subgenotype D4 with Unique Genomic Signatures Identified in North-Eastern India –An Emerging Clinical Challenge?

BACKGROUND/AIMS HBV has been classified into ten genotypes (A-J) and multiple subgenotypes, some of which strongly influence disease outcome and their distribution also correlate with human migration. HBV infection is highly prevalent in India and its diverse population provides an excellent opportunity to study the distinctiveness of HBV, its evolution and disease biology in variegated ethnic ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2011